A fast point-based algorithm for POMDPs

نویسندگان

  • Nikos Vlassis
  • Matthijs T. J. Spaan
چکیده

We describe a point-based approximate value iteration algorithm for partially observable Markov decision processes. The algorithm performs value function updates ensuring that in each iteration the new value function is an upper bound to the previous value function, as estimated on a sampled set of belief points. A randomized belief-point selection scheme allows for fast update steps. Results indicate that the proposed algorithm achieves competitive performance, both in terms of solution quality

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying Metric-Trees to Belief-Point POMDPs

Recent developments in grid-based and point-based approximation algorithms for POMDPs have greatly improved the tractability of POMDP planning. These approaches operate on sets of belief points by individually learning a value function for each point. In reality, belief points exist in a highly-structured metric simplex, but current POMDP algorithms do not exploit this property. This paper pres...

متن کامل

A New Maximum Power Point Tracking Method for PEM Fuel Cells Based On Water Cycle Algorithm

Maximum Power Point (MPP) tracker has an important role in the performance of fuel cell (FC) systems improvement. Tow parameters which have effect on the Fuel cell output power are temperature and membrane water. So contents make the MPP change by with variations in each parameter. In this paper, a new maximum power point tracking (MPPT) method for Proton Exchange Membrane (PEM) fuel cell is pr...

متن کامل

Solving POMDPs: RTDP-Bel vs. Point-based Algorithms

Point-based algorithms and RTDP-Bel are approximate methods for solving POMDPs that replace the full updates of parallel value iteration by faster and more effective updates at selected beliefs. An important difference between the two methods is that the former adopt Sondik’s representation of the value function, while the latter uses a tabular representation and a discretization function. The ...

متن کامل

Anytime Point Based Approximations for Interactive POMDPs

Partially observable Markov decision processes (POMDPs) have been largely accepted as a rich-framework for planning and control problems. In settings where multiple agents interact POMDPs prove to be inadequate. The interactive partially observable Markov decision process (I-POMDP) is a new paradigm that extends POMDPs to multiagent settings. The added complexity of this model due to the modeli...

متن کامل

Planning under Uncertainty

In a fast point-based algorithm for POMDPs (see [Vlassis, Spaan]) the proposed algorithm is tested using various experiments. One of these experiments concerns the Tag domain as proposed in [Pineau, Gordon, Thrun]. The goal in this domain is to search for a moving robot and tag it. All the predator robot observes is its own square. We propose a method to increase this observation model, thereby...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004